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StAR HOMOLOGUES 



FIELD OF THE INVENTION 

5 The present invention concerns novel nucleic acid sequences, vectors and 

host cells containing them, amino acid sequences encoded by said sequences, and 
antibodies reactive with said amino acid sequences, as well as pharmaceutical 
compositions comprising 2Lny of the above. The present invention further concerns 
methods for screening for candidate activator or deactivators utilizing said amino 

10 acid sequences. 

BACKGROUND OF THE INVENTION 

The following publications are believed to be relevant as Background of the 
Invention: 

15 (1) Douglas, M.S. and Clark, BJ., Steroids, 62:29-36 (1997). 

(2) Miller, W.L., J. Steroid Biochem. Molec. Biol, 55(5,6):607-616 
(1995). 

(3) Kalian, et al. Molecular and Cellular Endocrinology, 145 :39-45 
(1998). 

20 (4) Gharani, et al, Human Molecular Genetics, 6(3):397-402 (1 997). 

(5) Legro, R.S., Molecular and Cellular Endocrinology, 145 :103-110 
(1998). 

Steroid hormones produced in the placenta, adrenals, ovaries, and gonads 
are important mediators of tissue differentiation, development, and homeostasis. 
25 The synthesis of these important mediators of both genomic and non-genomic 
effects is tightly regulated in a temporal and tissue-specific manner. The acute 
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stimulation of steroidogenesis in the adrenals and gonads is triggered by trophic 
hormone-induced generation of cAMP with subsequent activation of 
phosphorylation and gene transcription by the cAMP-dependent protein kinase A 
(PKA). The rate-Umiting step in steroid hormone synthesis is the side-chain 

5 cleavage of cholesterol to form pregnenolone, a step catalyzed by the side-chain 
cleavage enzyme system (P450scc), which is located on the matrix side of the inner 
mitochondrial membrane. 

The translocation of cholesterol from the cholesterol-rich outer 
mitochondrial membrane to the cholesterol-poor inner membrane is the true 

10 rate-limiting step in the acute control of steroidogenesis. Consistent with this 
proposition is the finding that freely diffusible cholesterol analogs (hydroxysterols) 
that are able to reach the inner mitochondrial membrane promote maximal levels of 
steroidogenesis in the absence of trophic stimulation. It has also been shown that 
the acute steroidogenic response is blocked within minutes of treatment with 

15 inhibitors of protein synthesis (i.e. cyclo-heximide or puromycin). Collectively, 
these observations suggested a model in which trophic hormone, through the 
intermediacy of cAMP, promotes the de novo synthesis of a labile or short-acting 
protein which functions to enhance transport of cholesterol from the outer to the 
inner mitochondrial membrane where it serves as substrate for the P450scc enzyme 

20 system.^"^^ 

The protein responsible for this transport is termed ''steroidogenic acute 
regulatory protein " (StAR). 

The importance of StAR in the regulation of steroidogenesis has been 
dramatically demonstrated in recent studies on the disease congenital lipoid adrenal 
25 hyperplasia (lipoid CAH). Lipoid CAH is a lethal condition that results from a 
complete inability of the newborn infant to synthesize steroids. The lack of 
mineralocorticoids and glucocorticoids results in death within days to weeks of 
birth if not detected and treated with adequate steroid hormone and salt 
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replacement therapy. It was demonstrated that lipoid CAH is due to defects in StAR 

n) 

expression or function. " 

The human StAR cDNA encodes a 285 amino acid protein with 37 kDa 
mobility on SDS polyacrylamide gels (SDS-PAGE). The StAR protein sequence is 

5 highly conserved (80-88% identity and 90% similarity) across all of the species 
tlius far compared, including: the mouse, human, hamster, rat, cow, sheep and pig. 
The protein c<miains putative phosphorylation sites for several protein kinases 
including the protein kinase A (PKA), calmodulin- dependent kinase II (CAM 
kinase U), and protein kinase C (PKC). The amino terminal 33 amino acids of the 

10 predicted polypeptide possesses a high net positive charge and the potential to form 
an amphipathic helical structure typical of mitochondrial signal sequences. Both 
murine and human StAR proteins are efficiently imported and processed by 
mitochondria of steroidogenic and non-steroidogenic cells/''' 

Polycystic ovary syndrome (PCOS) is a highly prevalent endocrine disorder 

15 which is characterized by hyperandrogenaemia and represents the most common 
cause of anovulatory infertility and hirsutism. PCOS has been estimated to have a 
population prevalence of between 5-10%. The characteristic polycystic ovarian 
morphology, however, may be foxmd in up to 22% of the normal population, with > 
90% of these women having at least one mild symptom that may be considered a 

20 clinical marker of PCOS. 

Hyperandrogenaemia is seen both in women with PCOS and men with 
premature male pattern baldness suggesting an underlying disorder of androgen 
biosynthesis or metabolism. Androgens are synthesized by the adrenals, the theca 
cell layer of the developing ovarian follicle and the testicular Leydig cells. Both 

25 scalp hair loss and hirsutism are known to be mediated by androgens. The 
sensitivity of the hair follicle to androgens is dependent on a number of factors, 
such as serum concentrations of bioavailable androgens and the presence and 
number of androgen receptors.^^^ 
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It has been shown that theca cells from polycystic ovaries show a significant 
increase in both androstenedione and progesterone production in vitro when 
compared to normal theca. There have been indications that a marker, CYPllei, 
which has been localized to chromosome 15q23- was associated with the 
5 Stien-Levental Syndrome (polycyclic ovary syndrome PCOSy'*^ 

In addition, androgens have been knovsm to play a major role in the 
regulation of various aspects of the biology of prostate cancer cells. 

GLOSSARY 

10 In the following description and claims use will be made, at times, with a 

variety of terms, and the meaning of such terms as they should be construed in 
accordance with the invention is as follows: 

'^Steroidogenic acute regulatory protein-homolog (StAR-B) nucleic acid 
1 5 sequence'^ - the sequence shown in SEQ ID NO: 1, sequences having at least 70% 

identity to said sequence and fragments of the above sequences being 20 b.p. long. 

This sequence is a sequence coding for a novel homolog of the known StAR 

protein. The two sequences share homology at the C-terminus which is the region 

responsible for physiological and catalytic activity of StAR. However, the term 
20 StAR-B does not necessarily signify that StAR-B protein coded by the above 

sequence has the same or even similar physiological effects as known StAR, merely 

that it shows sequence homology with the known StAR . 

''Steroidogenic acute protein (StAR-B product) - also referred at times as the 
25 'StAR-B protein'' or "StAR-B polypeptide'' - is an amino acid shown in SEQ ID 
NO: 2. The amino acid sequence may be a peptide, a protem, as well as peptides or 
proteins having chemically modified amino acids (see below) such as a 
glycopeptide or glycoprotein. An example of an StAR-B product is shown in 
SEQ ID NO: 2. The term also includes analogues of said sequences in which one or 
30 more amino acids has been added, deleted, substituted (see below) or chemically 
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modified (see below) as well as fragments of this sequence having at least 10 amino 
acids. 

Nucleic acid sequence" - a sequence composed of DNA nucleotides, RNA 
5 nucleotides or a combination of both types and may includes natural nucleotides, 
chemically modified nucleotides and synthetic nucleotides. 

^'Amino acid sequence" — a sequence composed of any one of the 20 naturally 
appearing amino acids, amino acids which have been chemically modified (see 
10 below), or composed of synthetic amino acids. 

^'Fragment of StAR-B product" - a polypeptide which has an amino acid 
sequence which is the same as part of but not all of the amino acid sequence of 
the StAR-B product. 

15 ^'Fragments of StAR-B nucleic acid sequence" sl continuous portion, preferably 
of about 20 nucleic acid sequences of the StAR-B nucleic acid sequence. 

Conservative substitution^' - refers to the substitution of an amino acid in one 
class by an amino acid of the same class, where a class is defined by common 

20 physicochemical amino acid side chain properties and high substitution 
frequencies in homologous proteins found in nature, as determined, for example, 
by a standard Dayhoff frequency exchange matrix or BLOSUM matrix. [Six 
general classes of amino acid side chains have been categorized and include: 
Class I (Cys); Class II (Ser, Thr, Pro, Ala, Gly); Class m (Asn, Asp, Gin, Glu); 

25 Class IV (His, Arg, Lys); Class V (He, Leu, Val, Met); and Class VI (Phe, Tyr, 
Trp). For example, substitution of an Asp for another class III residue such as 
Asn, Gin, or Glu, is a conservative substitution. 
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''Non-conservative substitution^' - refers to the substitution of an amino acid in 
one class with an amino acid from another class; for ex2imple, substitution of an 
Ala. a class II residue, with a class III residue such as Asp, Asn, Glu, or Gin. 

5 '^Chemically modified - when referring to the product of the invention, means a 
product (protein) where at least one of its amino acid resides is modified either by 
natural processes, such as processing or other post-translational modifications, or 
by chemical modification techniques which are well known in the art. Among 
the numerous known modifications typical, but not exclusive examples include: 

10 acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor 
formation, covalent attachment of a lipid or lipid derivative, methylation, 
myristlyation, pegylation, prenylation, phosphorylation, ubiqutination, or any 
similar process. 

15 ''Biologically active" - refers to the StAR-B product which have, regulatory or 
biochemical functions on the same target sites which naturally occurring StAR-B 
influence, for example an effect in regulation of steroid synthesis, in particular 
androgen synthesis. This term may refer to the same biological activity as 
manifested in different cells in different tissues which are responsible for 

20 hyperandrogonism, such as Theca cells, Leydig cells and cells of the adrenal 
cortex. 

''Immunologically active" defines the capability of a natural, recombinant or 
synthetic StAR-B product, or any fragment thereof, to induce a specific immune 
response in appropriate animals or cells and to bind with specific antibodies. 
25 Thus, for example, a biologically active fragment of StAR-B product denotes a 
fragment which retains some or all of the immunological properties of the 
StAR-B product, e.g can bind specific anti-StAR-B product antibodies or which 
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can elicit an immune response which will generate such antibodies or cause 
proliferation of specific immune cells which produce StAR-B. 

''Optimal alignment'' - is defined as an alignment giving the highest percent 
5 identity score. Such alignment can be performed using a variety of commercially 
available sequence analysis programs, such as the local alignment program 
LALIGN using a ktup of 1, default parameters and the default PAM. A preferred 
alignment is the one performed using the CLUSTAL-W program from 
MacVector (TM), operated with an open gap penalty of 10.0, an extended gap 
10 penalty of 0.1, and a BLOSUM similarity matrix. If a gap needs to be inserted 
into a first sequence to optimally align it with a second sequence, the percent 
identity is calculated using only the residues that are paired with a corresponding 
amino acid residue (i.e., the calculation does not consider residues in the second 
sequences that are in the "gap" of the first sequence). 

15 

"Having at least X% identity'^ - with respect to two amino acid or nucleic acid 
sequence sequences, refers to the percentage of residues that are identical in the 
two sequences when the sequences are optimally aligned. Thus, 70% amino acid 
sequence identity means that 70% of the amino acids in two or more optimally 
20 aligned polypeptide sequences are identical. 

"Isolated nucleic acid molecule having an StAR-B nucleic acid sequence" - is 
a nucleic acid molecule that includes the coding StAR-B nucleic acid sequence. 
Said isolated nucleic acid molecule may include the StAR-B nucleic acid 
25 sequence as an independent insert; may include the StAR-B nucleic acid 
sequence fused to an additional coding sequences, encoding together a fiision 
protein in which the StAR-B coding sequence is the dominant coding sequence 
(for example, the additional coding sequence may code for a signal peptide); the 
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S.AR-B nucleic acid sequence n,ay be in combination wim non-cod,ng 
sequences, e.g.. inuons or control elements, such as promoter and termma„r 
elements or . and/or . untranslated regions, effective for — 

♦oKi^ y^nvt' or mav be a vector in which the StAK-tJ 
coding sequence in a suitable host, or may oc 

5 protein coding sequence is a heterologous. 

.^Expression vector" - refers to vectors that have the ability to incorporate and 
express heterologous DNA fragments in a foreign cell. Many P^^^^^ ^' 
eukaryotic expression vectors are known and/or commercially availab^- 
eukaiyo ..session vectors is within the knowledge of those 

,0 Selection of appropnate expression vecror 

having skill in the art. 

..Z>./er«.„" - is a change in either nucleotide or amino acid sequence in which 
one or more nucleotides or amino acid residues, respectively, are absent. 

" .;„ser,^n- or "««i«tfo«" - « that char,ge in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or ammo 
acid residues, respectively, as compared to the naturaUy occurring sequence. 

,0 " - -placemen, of one or more nucleotides or amino acK3s ^ 

dif^nt nucleotides or amino acids, respectively. As regards ammo acd 
sequences the substitution may be conservative or non- conservattve. 

«^„„WV" - refers to IgG, IgM. IgD. IgA. and IgG antibody, lite de&tition 
« includes 'polyclonal antibodies or monoclonal antibodies, "ntis term refers to 
:S L J^es or ^ents of the antibodies comprising the antigen bm «S 
domain of tite anti-StAR-B product antibodies, e.g. antibod.es wttitou, the Fc 
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portion. single chain antibodies, fragments consisting of essentially only the 
variable, antigen-binding domain of the antibody, etc. 

^'Activator" - as used herein, refers to a molecule which mimics the effect of the 
5 natural StAR-B product or at times even increases or prolongs the duration of the 
biological activity of said product, as compared to that induced by the natural 
product. The mechanism may be by binding to the inner mitochondria membrane 
thus increasing the activity of StAR-B, by prolonging the lifetime of the StAR-B, 
by increasing the activity of the StAR-B on its target (transport of cholesterol), by 
10 increasing the affinity of StAR-B to moieties which it binds (such as cholesterol) 
etc. Activators may be polypeptides, nucleic acids, carbohydrates, lipids, or 
derivatives thereof, or any other molecules which can bind to and activate the 
StAR-B product. 

15 ''Deactivator" or C'Jnhibitor'') - refers to a molecule which modulates the 
activity of the StAR-B product in an opposite manner to that of the activator, by 
decreasing or shortening the duration of the biological activity of the StAR-B 
product. This may be done by blocking the binding of the StAR-B to cholesterol 
(competitive or non-competitive inhibition), by causing rapid degradation of the 

20 StAR-B, etc. by inhibiting association of the StAR-B with the inner membrane of 
the mitochondria, etc. Deactivators may be polypeptides, nucleic acids, 
carbohydrates, lipids, or derivatives thereof, or any other molecules which bind to 
and modulate the activity of said product 

25 '^Treating a disease'' - refers to administering a therapeutic substance effective 
to ameliorate symptoms associated with a disease, to lessen the severity or cure 
the disease, or to prevent the disease from occurring. 
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•^Detection'' - refers to a method of detection of a disease. This term may refer to 
detection of a predisposition to a disease. 

"Probe" - the StAR-B nucleic acid sequence, or a sequence complementary 
5 therewith, when used to detect presence of other similar sequences in a sample. 
The detection is carried out by identification of hybridization complexes between 
the probe and the assayed sequence. The probe may be attached to a solid support 
or to a detectable label. 

SUMMARY OF THE INVENTION 
,0 The present invention is based on the surprising finding that there exist in 

humans a novel homolog of the StAR having a significant homology to the 
C-terminus of native StAR, which is the physiologically active region of that 
protein. The novel homolog was termed "StAR-B". StAR-B was found to be 
localized on chromosome 15q23-q24. On this locus are also localized cytochrome 
15 P450Wc variants, and the Stein Levental Syndrome (PCOS) was linked to this 
site<' It is now belived that modulation .of StAR-B levels and/or activity underlies 
the pathologies associated with PCOS.as well as other physiological conditions 
linked to PCOS families such as male premature baldness (MPB). 

Thus the present invention provides by its first aspect, a novel isolated 
20 nucleic acid molecule comprising or consisting of the sequence SEQ ID NO: 1, 
fraements of said sequence having at least 20 nucleic acids, or a molecule 
comprising a sequence having at least 70%, preferably 80%. and most preferably 
90% identity to SEQ ID NO.lThe present invention fiirther provides a protein or 
poh^eptide comprising or consisting of an amino acid sequence encoded by any of 
2s the above nucleic acid sequences, termed herein "SUR-B product", for example, 
an amino acid sequence having the sequence as depicted in SEQ ID NO: 2, 
fragments of the above amino acid sequence having a length of at least 10 amino 
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acids, as well as homologs of the amino acid sequences SEQ ID NO.:2 in which 
one or more of the amino acid residues has been substituted (by conservative or 
non-conservative substitution) added, deleted, or chemically modified. 

The present invention further provides nucleic acid molecule comprising or 

5 consisting of a sequence which encodes the above amino acid sequences, 
(including the fragments and analogs of the amino acid sequences). Due to the 
degenerative nature of the genetic code, a plurality of altemative nucleic acid 
sequences, beyond SEQ ID NO:l, can code for the amino acid sequence of the 
invention. Those altemative nucleic acid sequences which code for the same amino 

10 acid sequences codes by the sequence SEQ ID NO: 1 are also an aspect of the of 
the present invention. 

The present invention further provides expression vectors and cloning 
vectors comprising any of the above nucleic acid sequences, as well as host cells 
transfected by said vectors. 

15 The present invention still further provides pharmaceutical compositions 

comprising, as an active ingredient, said nucleic acid molecules, said expression 
vectors, or said protein or polypeptide. 

These pharmaceutical compositions are suitable for the treatment of diseases 
and pathological conditions, which can be ameliorated or cured by raising the level 

20 of the StAR-B product. Typically these are diseases which are manifested by 
non-normal levels of steroid hormories (which can be higher or lower than normal 
levels). The compositions are intended to restore the levels to normal levels. Thus 
the pharmaceutical compositions may serve as altemative regions for hormonal 
administration. Especially for the treatment of PCOS-involved conditionds as well 

25 as MPB By a second aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a non-coding sequence which is complementary to that 
of SEQ ID NO:l, or complementary to a sequence having at least 70% identity to 
said sequence or a fragment of said two sequences. The complementary sequence 
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„ay be a DNA sequence «hich hybridizes «im *e SEQ of ID N0:1 or hybridizes 
to a portion of *at sequence having . leng* sufficient to inhibit the transcription of 
complementary sequence. The complementary sequence may be a DNA 
sequence which can be transcribed into an mRNA being an antisense to the mRNA 
transcribed from SEQ ID NO:l or into an mRNA which is an antisense to a 
ftagment of the mRNA transcribed from SEQ ID NO.:l which has a length 
sufficient to hybridize with the mRNA transcribed from SEQ ID NO. l^so as ^ 
inhibit its translation. The complementary sequence may also be the mRNA or the 
fragment of the mRNA itself. 

The nucleic acids of the second aspect of the invention may be used for 
therapeutic or diagnostic applications for example for detection of the expression of 
StAR-B in various tissues such as to ovary, adrenal, placenta, etc. 

The present invention also provides expression vectors comprising any one 
of the above defined complementary nucleic acid sequences and host cells 
,3 transfected wifl, said nucleic acid sequences or vectors, being complementary to 
fliose specified in the first aspect of the invention. 

■J— titAR.B oroduct antibodies, namely 
The invention also provides anU-StAKU proou 

antibodies directed against tite StAR-B produa which specifically bmd^^d 
StAR-B product, said antibodies are useM boti. for diagnostic and mer^««.c 
pZscs For example said antibody may be as an active tngredtent m a 
pharmaceutical composition as wUl be explained below. .^^^ons 
The present invention also provides pharmaceutical composttions 
comprising, as an active ingredient, the nucleic acid molecules which compnse or 
of said complementary sequences, or of a vector compnsmg s.d 
consist oi sai y „h»rmace"«'-'>i comoosition tilus provides 

25 complementary sequences. The pharmace . ., .„txj,n 

phaLceutiea. compositions comprising, as an active mgr^lient, said anti-StAR 

""^■^'C— cal compositions comprising said anti-StAR-B product 
antibodies or the nucleic acid molecule comprising said complementiiry sequence. 
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are suitable for the treatment of diseases and pathological conditions where a 
therapeutically beneficial effect may be achieved by neutralizing the StAR-B or 
decreasing the amount of the StAR-B product or blocking its binding to its effector 
(cholesterol), for example, by the neutralizing effect of the antibodies, or by the 

5 decrease of the effect of the antisense mRNA in decreasing expression level of the 
StAR-B product. Mostly these diseases are manifested by non-normal level of 
steroid hormones in the diseased persons which may be regulated to produce 
normal levels by utilizing the pharmaceutical compositions of the invention. Thus 
the compositions of the invention may serve as alternative treatment regimes for 

10 hormonal administration. Preferably the pharmaceutical compositions are for the 
treattnent of conditions involving PCOS and MPB. By another application, the 
therapeuticals may be utilized for the treatment of prostate cancer. 

According to the third aspect of the invention the present invention provides 
methods for detecting the level of the transcript (mRNA) of said StAR-B product in 

15 a body fluid sample, or in a specific tissue sample (notably the ovary, placenta, 
adrenal), for example by use of probes comprising or consisting of said coding 
sequences; as well as methods for detecting levels of expression of said product in 
tissue, e.g. by the use of antibodies capable of specifically reacting with the above 
amino acid sequences. Detection of the level of the expression of the StAR-B 

20 variant of the invention may be indicative of a plurality of physiological or 
pathological conditions. 

The method, according to this latter aspect, for detection of a nucleic acid 
sequence which encodes the StAR-B product in a biological sample, comprises the 
steps of: 

25 (a) providing a probe comprising at least one of the nucleic acid 

sequence defined above; 

(b) contacting the biological sample with said probe under conditions 
allowing hybridization of nucleic acid sequences thereby enabling formation of 
hybridization complexes; 
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(c) detecting hybridization complexes, wherein the presence of the 
complex indicates the presence of nucleic acid sequence encoding the StAR-B 
product in the biological sample. 

By a preferred embodiment the probe is part of a nucleic acid chip used for 
5 detection purposes, i.e. the probe is a part of an array of probes each present in a 
known location on a solid support. 

The nucleic acid sequence used in the above method may be a DNA 
sequence an RNA sequence, etc; it may be a coding or a sequence or a sequence 
complementary thereto (for respective detection of RNA transcripts or 
10 coding-DNA sequences). By quantization of the level of hybridization complexes 
and calibrating the quantified results it is possible also to detect the level of the 
transcript in the sample. 

Methods for detecting mutations in the region coding for the StAR-B 
product are also provided, which may be methods carried-out in a binaiy fashion, 
15 namely merely detecting whether there is any mismatches between the normal 
StAR-B nucleic acid sequence and the one present in the sample, or carried-out by 
specifically detecting the nature and location of the mutation. 

The present invention also concerns a method for detecting StAR-B product 
both for determining its presence ,as well as its level or alterations in its levelin a 
20 biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of the invention, 
thereby fonning an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
25 presence of StAR-B product in said biological sample. 

Both detection of StAR-B product and transcript, for example in urine 
samples, may be used to discriminate between various stages of prostate cancer. 
Changes in levels as compared to normal controls may be indicative of a 
pathological state. 
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Bv ve. another aspect the invention also provides a meflrod for identifying 
eandidat ol ounds Ja.e of Hndin, to .he StAR-B pr^uc. a„^modu,at»g rts 
TivityCbein. either activators or deactivators). The method u,ciudes. 

( ) 'providing a protein or polypeptide compnstng an am.no ac.d 
(,) proviouiE f _ j^Q. 2, or a fragment of such a 

5 sequence substantially as depicted m SEQ ID NU. A 

'""Ti) comactine a candidate compound «ith said amino acid sequence; 

, lasurin; the physiological effect of said candidate compour^ J 
*e actWin- of the amino acid sequences and selecting those compounds whtoh 

.emgier theicivator or ^^-^zi ::zz!:r:::^^ tz 

StAR-B product to choiest^o '^^l^^^^^^^ ^ .„dula,or 

>^er -^'^-'^JZ^^^'^'^-LsJ^.o^^'^^. as serving as an 
15 which changes such an activiiy ii« 

^~::rn: — .so concems compounds identified hy the ahove 
methods described above, which compound may either be an ac«vator of the 
StAR-B product or a deactivator thereof. 

20 

wilVV DESCRIPTION OF THE DRAWINGS 

^Tunderstand*^ 
practicr a preferred embodimen. will no. be described, by way o^^^non— 
Lmple only, with referencetotheaccompanying^-^'^^ 

Fig. 1 is the nucleic acid sequence of the invenfon SEQ ID NO. 1. 
" F;.2istheaminoacidsequenceof.heinventionSEQIDNO:2;a.^ 

Z 3 is a comparison of the known S.AR sequence and the StAK-B 

sequence of the invention. 
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DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 
Example 1: STAR-B - nucleic acid sequence 

The nucleic acid sequences of the invention include nucleic acid 
5 sequences which encode StAR-B product and fragments and analogs thereof. T^e 
nucleic acid sequences may alternatively be sequences complementary to the 
above coding sequence, or .0 a region of said coding sequence. The length of the 
complementary sequence is sufficient to avoid the expression of *e codmg 
sequence. Tbe nucleic acid sequences may be in the form of BNA or in *e form 
,„ of Dm. and include messenger RKA. synthetic R>.A and DNA. cDNA. and 
genomic DNA. The DNA may be double-stranded or single-stranded, and rf 
single-stranded may be the coding strand or the non-coding (anti-s^tse. 
complementary) strand. The nucleic acid sequences may also bom mclude 
dNTPs tNTPs as well as non naturally occurring sequences. Tbe sequence may 
,5 also bi a part of a hybrid between an amino acid sequence and a nucleic ac.d 

sequence. , . i * nno/ 

In a general embodiment, the nucleic acid sequence has at least 70/.. 

preferably 80V. or 90«/. sequence identity with the sequence identified as SEQ ID 

The nucleic acid sequences may include the coding sequence by itself. By 
another alternative the coding region may be in combination with additional 
coding sequences, such as those coding for fl^ion protein or signal pepttdes, m 
combination with non-coding sequences, such as introns and con«>l el^en^. 
promoter and urminator elements or 5' and/or 3' untransUted regions. effec«ve 
„ L expr«sion of the coding sequence in a suitable host, and/or in a vector or hos 
enviromnent in which the StAR-B nucleic acid sequence is introduced as a 
heterologous sequence. 
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The nucleic acid sequences of the present invention may also have the 
product coding sequence fused in-frame to a marker sequence which allows for 
purification of the StAR-B product. The marker sequence may be, for example, a 
hexahistidine tag to provide for purification of the mature polypeptide fused to 
the marker in the case of a bacterial host, or, the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 cells, is used. The 
HA tag corresponds to an epitope derived from the influenza hemagglutinin 
protein (Wilson, I., et al. Cell 32:767 (1984)). 

Also included in the scope of the invention are fragments also referred to 
herein as oligonucleotides, typically having at least 20 bases, preferably 20-30 
bases corresponding to a region of the coding-sequence nucleic acid sequence. 
The fragments may be used as probes, primers, and when complementary also as 
antisense agents, and the like, according to known methods. 

As indicated above, the nucleic acid sequence may be substantially a 
s depicted in SEQ ID NO:l or fragments thereof or sequences having at least 70o/o, 
preferably 70-80%. most preferably 9Wo identity to the above sequence. 
Alternatively, due to the degenerative nature of the genetic code, the sequence 
may be a sequence coding the amino acid sequence of SEQ ID NO:2. or 
fragments or analogs of said amino acid sequence. 

JO 

A. Preparation of nucleic acid sequences The nucleic acid sequences may 
be obtained by screening cDNA libraries using oligonucleotide probes which can 
hybridize to or PCR-amplify nucleic acid sequences which encode the StAR-B 
products disclosed above. cDNA libraries prepared from a variety of tissues are 
,3 commercially available and procedures for screening and isolating cDNA clones 
are well-known to those of skill in the art. Such techniques are described m, for 
example, Sambrook et al (1989) Molecular Cloning: A Laboratory Manual (2nd 
Edition), Cold Spring Harbor Press, Plainview. N.Y. and Ausubel FM et al. 
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(,989) current Protocols in Molecular Biology, John Wiley & Sons. New York. 



N.Y. 



The nucleic acid sequences may be extended to obtain upstream and 
downstream sequences such as promoters, regulatory elements, and 5' and 3' 
untranslated regions (UTRs). Extension of the available tnmsoript sequence may 
be performed b> numerous methods known to those of skill in the ar. such as 
PCR or primer extension (Sambrook al. s^ra,, or by .he RACE method 
using, for example, the Marathon RACE kit (Clontech, Cat. # K1802.1). 

Alternatively, the technique of -r^tricion-sUc'' PCR (Gob.nda e, al PCR 
„ Me.Hc^ Appiic. 2:318-22, (1993)). which uses universal primers to retneve 
Lking sequence adjacent a known iocus. may be employed. Eirs. genomtc 
DNA is amplified in the presence of primer ,o a linker sequence a^d a pnme. 
specific .0 the k.own region. The amplified sequences a« subjected to a^rtd 
round of PCR with the same linker primer and another specfic prrmer mtemal to 
„ the firs, one. Products of each rotmd of PCR are transcribed with an appropnate 
RNA polymerase and sequenced using reverse transcriptase. 

inverse PCR can be used .o »nplify or extend sequences using dtvergent 
primers based on a known region (Triglia. T. « NucUic AcUs Res. ^:8186 
0988,). The prhners may be designed using OUGO(R) 4.06 Pruner Analysts 
Lftware (1992; National Biosciences Inc. ^^^J^^^Trf 
appropriate program, to be 22-30 nucleotides in length, to have a GC — of 
50% or more, and to armealtothetaxget sequence at temperaturesaboute^^a 

The method uses several restriction enzymes to generate a sutUble ftagment m 
*e known region ofagencT^eftagment is then Circularized by intramolecul. 

j5 ligation and used as a PCR template. , , , ,. ,.,„ ,9 

Capmre PCR (Lagerstrom, M. ai, PCR MetHods AppUc. 1,111-19. 
(.991)) is a method for PCR ampUfication of DNA fragments adjacent to a 
know sequence in human and yeast artificial chromo«>me DNA, Capuare PCR 
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also requires multiple restriction enzyme digestions and ligations to place an 
engineered double-stranded sequence into a flanking part of the DNA molecule 
before PGR. 

Another method which may be used to retrieve flanking sequences is that 
5 of Parker, J.D., et aL Nucleic Acids Res,, 19:3055-60, (1991)). Additionally, one 
can use PGR, nested primers and PromoterFinder^M libraries to "walk in" genomic 
DNA (PromoterFinder™; Glontech, Palo Alto, GA). This process avoids the need 
to screen libraries and is useful in finding intron/exon junctions. Preferred 
libraries for screening for full length cDNAs are ones that have been size-selected 
10 to include larger cDNAs, Also, random primed libraries are preferred in that they 
will contain more sequences which contain the 5* and upstream regions of genes. 

A randomly primed library may be particularly useful if an oligo d(T) 
library does not yield a fiilHength cDNA. Genomic libraries are useful for 
extension into the 5* nontranslated regulatory region. 
15 The nucleic acid sequences and oligonucleotides of the invention can also 

be prepared by solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized, then 
joined to form continuous sequences up to several hundred bases. 

20 B. Use of StAR-B nucleic acid sequence for the production of StAR-B 
products 

In accordance with the present invention, nucleic acid sequences specified 
above may be used as recombinant DNA molecules that direct the expression of 
25 StAR-B products. 

As will be understood by those of skill in the art, it may be advantageous 
to produce StAR-B product-encoding nucleotide sequences possessing codons 
other than those which appear in SEQ ID NO:l which are those which naturally 
occur in the human genome. Codons preferred by a particular prokaryotic or 
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eukaryotic host (Mur^y. E. Nuc Ac>,s Res.. 17..477.508, (1989)) can be 

selected, for example, to increase the rate of StAR-B product expression or to 
produce recombinant RNA transcripts having desirable properties, such as a 
longer half-life, than transcripts produced from naturally occuning sequence. 

The nucleic acid sequences of the present invention can be engineered m 
order to alter a StAR-B product coding sequence for a variety of reasons, 
including but no, limited to. alterations which modify the cloning, processtng 
and/or expression of the product. For example, alterations may be introduced 
using techniques which are well known in the art. e.g., site-direcud mutagenes.s. 
,0 to insen new .^striction sites, to alter glycosylation patterns, to change codon 
preference, to produce splice variants, etc. 

The present invention also includes recombinant consn»^ compnsmg 
one or more of the sequences as broadly described above. TTe construct 
comprise a vector, such as a plasmid viral vector, into which a nucle.c ac.d 
„ sequence of the invention has been inserted, in a forward or reverse orientafon. 
In a preferred aspect of this embodiment, the construct further comprrses 
regulatory sequences, including, for example, a promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those 
of skill in the art. and are commercially available. Appropriate cloning and 
„ expression vectors for use with prokaryotic and eukaryotic hosts are also 

described in Sambrook,e< a/., (it?"-'?)- 

The present invention also relates to host cells which are genefcally 
engineered with vectors of the invention, and the production of the product of the 
invention by recombinant techniques. Host cells are genetically engineered (..e 
„ tra^duced, transformed or transtected) with the vectors of this mventton whtch 
„ay be. for example, a cloning vector or an expression vector. The vector may 
be. for example, in the form of a plasmid, a viral particle, a phage, etc^ The 
engineered host cells can be culn«d in conventional nutrient media modtfied as 
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appropriate for activating promoters, selecting transformants or amplifying the 
expression of the StAR-B nucleic acid sequence. The culture conditions, such as 
temperature, pH and the like, are those previously used with the host cell selected 
for expression, and will be apparent to those skilled in the art. 

5 The nucleic acid sequences of the present invention may be included in 

any one of a variety of expression vectors for expressing a product. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast 
plasmids; vectors derived from combinations of plasmids and phage DNA, viral 

10 DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, 
any other vector may be used as long as it is replicable and viable in the host. 
The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures known in the art. Such procedures 

,5 and related sub-cloning procedures are deemed to be within the scope of those 
skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct mRNA synthesis. 
Examples of such promoters include: LTR or SV40 promoter, the E.coli lac or 

20 trp promoter, the phage lambda PL promoter, and other promoters known to 
control expression of genes in prokaryotic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site for translation 
initiation, and a transcription terminator. The vector may also include 
appropriate sequences for amplifying expression. In addition, the expression 

25 vectors preferably contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E.coli. 
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The vector contaimng the appropriate DNA sequence as described above, 
as well as an appropriate promoter or control sequence, may be employed to 
.ransform an appropriate host to permit the host to express the protein. Examples 
of appropriate expression hosts include: bacterial cells, such as E.coU 
S.rep,omyces. Salmonella typHUnuri^. fungal cells, such as yeasf. insect oe^^ 
such as DrosopHila and Spodcptera Sf9; animal cells such as CHO. COS. IffiK 
293 or Bov.es melanoma; adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. The invention is not limited by the host cells employed. 

in bacterial systems, a number of expression vectors may be selected 
depending upon the use intended for the StAR-B product. For example when 
large quantities of StAR-B product are needed for the induction of a„t.bodt«. 
veL which direct high level expression of fUsion proteins that are readtly 
purified may be desirable. Such vectors include, but are 
, multifimctional Kcoli cloning and expression vectors such as 

(Stratagene), in which the StAR-B polypeptide coding sequence may be l.gat^ 
into the vector in-frame with sequences for the amino-terminal Me. and the 
subsequent 7 residues of beta-galactosidase so that a hybrid P"'- 
p/^ vectors (Van HeeUe & Schuster,/. Biol CHenu 264:5503-5509. (19S9));p£r 
vectors (Novagen, Madison WI); and the like. 

I„ the yeast Saocharon^ c^e.i.iae a number of vectors contammg 
constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH may be used. For reviews, see Ausubel al. (^a) and Grant aU 
(Mertodsm£n2ym<,togyl«:5 16-544, (1987)). 

fa cases where plant expression vectors are used, the expresston of a 
sequence encoding StAR-B product may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S promoters of 
Cam (Brisson e, al. Nature 310:511-514. (1984)) may be used alone or m 
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combination wUh .he o.e,a .ead. sequence fron. TMV C—-; 

J' barRNr^...^. FroH. Ceil Oiffer- 12:85-.05. (.991)) may be used. 

' ^t—s oanbein^oauee...^ 

. f t;^n For reviews ofsuch techniques, see HobbsS. 
. .o.e— ^^^^^^ science an. TechnCo.. 

irH„!-rvL...V..PPm.9.o..e.ba..aW.^^^^^^ 



10 

421-463. 



i=n he exoressed in an insect system. In one such 
<^tAR-B oroduct may also be expresscu m 

.,.Jt:o^....ea.e^^^^^ 
r*"' "^ZZLTTir^Zl^'Z^ -a a nones^en.. 

*e po'^---- :t: . ce.. 

protemcoat. Therecombman 

J. Kiro/. 46:584, (1983); Engelhard, EX. «« foe. »a 

n .timber of viral-based expression systems may 
In mammalian host cells, a number ot v™ o 
. ..■ . ,„ .„es where an adenovirus is used as an expression vector, a 
beutilaed. in cases where ^ adenovirus 

„ StAR-B product coding sequence may be hg 

^scriptlon/transUtion complex — J " ,.„ome 

,.der se.ue.ce. Insertion In a ^ , ,,.ted host 
will result in a viable virus capable of expressmg !.tAK 
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cells (Logan and Shenk, Proc. Natl. Acad. Sci. 81:3655-59, (1984). In addition, 
transcripUon enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 
used to increase expression in maminalian host cells. 

Specific initiation signals may also be required for efficient translation of 
5 a StAR-B protein coding sequence. These signals include the ATG initiation 
codon and adjacent sequences. In cases where StAR-B product coding sequence, 
its initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional translational control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
,0 exogenous transcriptional control signals including the ATG initiation codon 
must be provided. Furthermore, the initiation codon must be in the correct 
reading frame to ensure transcription of the entire insert. Exogenous 
transcriptional elements and initiation codons can be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the 
,5 inclusion of enhancers appropriate to the cell system in use (Scharf, D. et al. 
(1994) Results Probl Cell Differ.. 20:125-62. (1994); Bittner et al.. Methods in 

Enzymol 153:516-544, (1987)). 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
20 eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dexlran mediated transfection. or electroporation 
(Davis, L., Dibner, M., and Battey, I. (1986) Basic Methods in Molecular 
25 Biology). Cell-free translation systems can also be employed to produce 
polypeptides using RNAs derived from the DNA constructs of the present 
invention. 
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A host cell strain may be chosen for its ability to modulate the expression 
of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the protein include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and 

5 acylation. Post-translational processing which cleaves a "pre-pro" form of the 
protein may also be important for correct insertion, folding and/or fimction. 
Different host cells such as CHO, HeLa, MDCK, 293, WI38, etc. have specific 
cellular machinery and characteristic mechanisms for such post-translational 
activities and may be chosen to ensure the correct modification and processing of 

10 the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, cell lines which stably express StAR-B 
product may be transformed using expression vectors which contain viral origins 
of replication or endogenous expression elements and a selectable marker gene. 

15 Following the introduction of the vector, cells may be allowed to grow for 1-2 
days in an enriched media before they are switched to selective media. The 
purpose of the selectable marker is to confer resistance to selection, and its 
presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clumps of stably transformed cells can be 

20 proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell 
lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler M., et al. Cell Ur^lZ-^l, (1977)) and adenine 
phosphoribosyltransferase (Lowy L, et al. Cell 22:817-23, (1980)) genes which 

25 can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, 
antibiotic or herbicide resistance can be used as the basis for selection; for 
example, dhfr which confers resistance to methotrexate (Wigler M., et al, Proc. 
Natl. Acad, ScL 77:3567-70, (1980)); npt, which confers resistance to the 
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.minoglycosides neomycin and G-418 (Colbere-Garapin. F. e, al. J. Mol. B,cl.. 
150 1-14 (1981)) and als or p«, which confer resismnce to chlorsulfuron and 
^Tosphinotricin acctyltransfcrase, respectively (Murry, supra). Addmonal 
.electable genes have been described, for example, <rpB, which allows cells to 
, utilize indole in place of tryptophan, or HisD. which allows cells to utilize hrsunol 
i„ place of histidine (Hartman S.C. and R.C. Mulligan. Proc. Natt. Acad. Sc.. 
8S-8047-51 (1988». The use of visible markers has gained popularity wtth such 
marlters as anthocyanins, beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrates, luciferin and ATP. being widely used not only to 
,0 identify transformants, but also to quantifr the amount of transient or stable 
protein expression attributable to a specific vector system (Rhodes, C.A. «. al.. 
Methods Mol. Biol. 55:121-131, (1995)). 

Host cells transformed with a nucleotide sequence encodmg StAR-B 
product may be cultured under conditions suitable for the expre^ion and 
recovery of the encoded protein from cell culture. The product produced by a 
recombinant cell may be secreted or contained intracellularly depetrding on *e 
sequence and/or the vector used. As will be undersU>od by those of skrll m the 
art, expression vectors containing nucleic acid sequences --<^8 St^R-B 
product can be designed with signal sequences which direct secreuon of StAR-B 
,0 product through a prokaryotic or eukaryotic cell membrane. 

StAR-B product may also be expressed as a rccombinantprotein wth one 

or more additional polypeptide domains added to facilitate protein purification. 
Such purification facilitating domains include, bu, are not limited to. metal 
chelating peptides such as histidine-tryptophan modules that allow purification on 
„ immobilized metals, protein A domains that allow purification on mmtobUtzed 
" immunoglobulin, and the domain utilized in a,e FLAGS extenston/affit^rty 
purification system Onm,unex Corp. SeatUe. Wash.). The incluston of a 
protease-cleavable polypeptide linker sequence between the purification domatn 
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and SlAR-B protein is useful to facilitate purification. One such expression 
vector provides for expression of a fusion protein compromising a StAR-B 
polypeptide fused to a polyhistidine region separated by an enterokinase cleavage 
site. The histidine residues facilitate purification on IMIAC (immobilized metal 

5 ion affmit\' chromatography, as described in Porath, et aL, Protein Expression 
and Piirificaiion. 3:263-281, (1992)) while the enterokinase cleavage site 
provides a means for isolating StAR-B polypeptide from the fusion protein. 
pGjEA" vectors (Promega, Madison, Wis.) may also be used to express foreign 
polypeptides as fusion proteins with glutathione S-transferase (GST). In general, 

10 such fusion proteins are soluble and can e£isily be purified from lysed cells by 
adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case of 
GST- fusions) followed by elution in the presence of free ligand. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter is induced by 

15 appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting crude extract retained 
for further purification. Microbial cells employed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw cycling, 

20 sonication, mechanical disruption, or use of cell lysing agents, or other methods, 
which are well know to those skilled in the art. 

The StAR-B products can be recovered and purified from recombinant cell 
cultures by any of a number of methods well known in the art, including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 

25 exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography, and lectin chromatography. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature protein. Finally, high 
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pe«ce .Md chromatography (HPLC) can be err-ployed for f.na, 
purification steps. 

C Diagnostic applications utmzing nucleic acid sequences 

, ■ The nucleic acid sequences of the present invention may be used for a 
variety of diagnostic purposes. The nucleic acid sequences may be used to det^t 
Tuantitate expression of StAK-B in paUenfs ceUs, e.g. b.opsted ttssu^ by 

f mRNA oding for StAR-B product. Alternatively, the 
" detect soiubic StAR-B in the serum or bioodThis assay 

,0 ^pi a., nvoives obtaining total mKKA .om the «ssue or serum and contac mg 
Te mK^A with a nucleic acid probe. The probe is a nucleic acd molecule o t 
the miuN nucleotides, capable of specifically 

least 20 nucleotides, preferably 20-3U nucieo , r , . ..^ 

hybridizing with a sequence included within the sequence of a nucleic acid 
ncoding S.AK-B under hybridizing conditions, detecting the present 
„ I mRNA hybridized to the probe, and thereby detecting the expression^o 
StAR-B. THis assay can be used to distinguish between absence presenc^d 
excess expression of StAR-B product and to monitor levels of StAR-B 
expression during therapeutic intervention. „„„ences as a 

The invention also contemplates the use of the nucleic acid sequences as 
,0 diagnosUc for diseases resulting from iriierited defective StAR-B sequences, or 
" :res in Which the purpose of the amount of the ^^J^J^^ 
StAR variant of the invention is altered. These sequences can be detected by 
crparing the sequences of the defective (i.e.. mutant) StAR-B codmg region 
Tthat of a normal coding region. Association of the sequence, coding for 
„ mutant StAR-B product with abnormal StAR-B product activity may be ve^«i^ 
m addition, sequences encoding mutant StAK-B products can be ins^^ m^o^^ 
suitable vector for expression in a fimctional assay system (e.g., colorimetnc 
suitaoie ve StAR-B protein deficient strain of 

assay, complementation expenments m a StAR B pro 
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HEK293 cells) as yet another means to verify or identify mutations. Once mutant 
genes have been identified, one can then screen populations of interest for 
carriers of the mutant gene. 

Individuals carrying mutations in the nucleic acid sequence of the present 

5 invention may be detected at the DNA level by a variety of techniques. Nucleic 
acids used for diagnosis may be obtained from a patient's cells, including but not 
limited to such as from blood, urine, saliva, placenta, tissue biopsy and autopsy 
material. Genomic DNA may be used directly for detection or may be amplified 
enzymatically by using PGR (Saiki, et al. Nature 324:163-166, (1986)) prior to 

10 analysis. RNA or cDNA may also be used for the same purpose. As an example, 
PGR primers complementary to the nucleic acid of the present invention can be 
used to identify and analyze mutations in the gene of the present invention. 
Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. 

15 Point mutations can be identified by hybridizing amplified DNA to 

radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA 
sequences of the invention. Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such RNase and SI protection or the 
chemical cleavage method (e.g. Cotton, et alProc. Natl. Acad. Sci: USA, 

20 85'-'^397-4401, (1985)), or by differences in melting temperatures. "Molecular 
beacons" (Kostrikis L.G. et al.. Science 272:1228-1229, (1998)), hairpin-shaped, 
single-stranded synthetic oligo- nucleotides containing probe sequences which 
are complementary to the nucleic acid of the present invention, may also be used 
to detect point mutations or other sequence changes as well as monitor expression 

25 levels of StAR-B product. Such diagnostics would be particularly useful for 
prenatal testing. 

Another method for detecting mutations uses two DNA probes which are 
designed to hybridize to adjacent regions of a target, with abutting bases, where 
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the region of known or suspected mutation(s) is at or near the abutting bases. 
The two probes may be joined at the abutting bases, e.g., in the presence of a 
ligase enzyme, but only if both probes are correctly base paired in the region of 
probe junction. The presence or absence of mutations is then detectable by the 

5 presence or absence of ligated probe. 

Also suitable for detecting mutations in the StAR-B product coding 
sequence arc oligonucleotide array methods based on sequencing by 
hybridization (SBH ). as described, for example, in U.S. Patent No. 5,547,839. In 
a typical meihod, the DNA target analyte is hybridized with an array of 

to oligonucleotides formed on a microchip. The sequence of the target can then be 
"read" from the pattern of target binding to the array. 

D. Gene mapping utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention are also valuable for 
15 chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual himian chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking chromosomal 
20 location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes 
Eissociated with disease. 

Briefly, sequences can be mapped to chromosomes by preparing PGR 
primers (preferably 20-30 bp) from the StAR-B cDNA. Computer analysis of the 
25 3* untranslated region is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, which would complicate the amplification 
process. These primers are then used for PGR screening of somatic cell hybrids 
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containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the primer will yield an amplified fi-agment. 

PCR mapping of somatic cell hybrids or using instead radiation hybrids 
are rapid procedures for assigning a particular DNA to a particular chromosome. 

5 Using the present invention with the same oligonucleotide primers, 
sublocalization can be achieved with panels of fragments from specific 
chromosomes or pools of large genomic clones in an analogous maimer. Other 
mapping strategies that can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow-sorted chromosomes and 

10 preselection by hybridization to construct chromosome specific-cDNA libraries- 
Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphsise 
chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with cDNA as short as 50 or 60 bases. For a 
review of this technique, see Verma et aL, Human Chromosomes: a Manual of 

15 Basic Techniques, (1988) Pergamon Press, New York. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in the OMIM database 
(Center for Medical Genetics, Johns Hopkins University, Baltimore, MD £uid 

20 National Center for Biotechnology Information, National Library of Medicine, 
Bethesda, MD). The OMIM gene map presents the cytogenetic map location of 
disease genes and other expressed genes. The OMIM database provides 
information on diseases associated with the chromosomal location. Such 
associations include the results of linkage analysis mapped to this interval, and 

25 the correlation of translocations and other chromosomal aberrations in this area 
with the advent of polygenic diseases, such as cancer, in general £ind prostate 
cancer in particular. 
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E. Therapeutic applications of nucleic acid sequences 

Nucleic acid sequences of the invention may also be used for therapeutic 
purposes. Turning first to the second aspect of the invention (i.e. inhibition of 
expression of StAR-B), expression of StAR-B product may be modulated through 

5 antisense technology, which controls gene expression through hybridization of 
complementary nucleic acid sequences, i.e. antisense DNA or RNA, to the 
control, 5* or regulatory regions of the gene encoding StAR-B product. For 
example, the 5* coding portion of the nucleic acid sequence sequence which 
codes for the product of the present invention is used to design an antisense 

10 oligonucleotide of firom about 10 to 40 base pairs in length. Oligonucleotides 
derived from the transcription Start site, e.g. between positions -10 and +10 from 
the Start site, are preferred. An antisense DNA oligonucleotide is designed to be 
complementary to a region of the nucleic acid sequence involved in transcription 
(Lee et aL, NucL Acids, Res., 6:3073, (1979); Cooney et al.. Science 241:456. 

15 (1988); and Dervan et aL, Science 251:1360, (1991)), thereby preventing 
transcription and the production of the StAR-B products. An antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the 
mRNA molecule into the StAR-B products (Okano J. Neurochem, 56:560. 
(1991)). The antisense constructs can be delivered to cells by procedures known 

20 in the art such that the antisense RNA or DNA may be expressed in vivo. The 
antisense may be antisense mRNA or DNA sequence capable of coding such 
antisense mRNA. The antisense mRNA or the DNA coding thereof can be 
complementary to the full sequence of nucleic acid sequences coding to the 
StAR-B protein or to a fragment of such a sequence which is sufficient to inhibit 

25 production of a protein product. 

Turning now to the first aspect of the invention, i.e. expression of StAR-B, 
expression of StAR-B product may be increased by providing coding sequences 
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for coding for said product under the control of suitable control elements ending 
its expression in the desired host. 

The nucleic acid sequences of the invention may be employed in 
combination with a suitable pharmaceutical carrier. Such compositions comprise 

5 a therapeutically effective amount of the compound, and a pharmaceutically 
acceptable carrier or excipient. Such a carrier includes but is not limited to saline, 
buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. 

The products of the invention as well as any activators and deactivators 

10 compounds (see below) which are polypeptides, may also be employed in 
accordance with the present invention by expression of such polypeptides in v/vo, 
which is often referred to as "gene therapy. " Cells from a patient may be 
engineered with a nucleic acid sequence (DNA or RNA) encoding a polypeptide 
ex vivo, with the engineered cells then being provided to a patient to be treated 

15 with the polypeptide. Such methods are well-known in the art. For example, cells 
may be engineered by procedures known in the art by use of a retroviral particle 
containing RNA encoding a polypeptide of the present invention. 

Similarly, cells may be engineered in vivo for expression of a polypeptide 
in vivo by procedures known in the art. As known in the art, a producer cell for 

20 producing a retroviral particle containing RNA encoding the polypeptide of the 
present invention may be administered to a patient for engineering cells in vivo 
and expression of the polypeptide in vivo. These and other methods for 
administering a product of the present mvention by such method should be 
apparent to those skilled in the art from the teachings of the present invention. 

25 For example, the expression vehicle for engineering cells may be other than a 
retrovirus, for example, an adenovirus which may be used to engineer cells in 
vivo after combination with a suitable delivery vehicle. 
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Retroviruses from which the retroviral plasmid vectors mentioned above 
may be derived include, but are not limited to, Moloney Murine Leukemia Virus, 
spleen necrosis virus, retroviruses such as Rous Sarcoma Vims, Harvey Sarcoma 
Vims, avian leukosis vims, gibbon ape leukemia vims, human inununodeficiency 
5 virus, adenovirus, Myeloproliferative Sarcoma Vims, and mammary tumor vims. 

The retroviral plasmid vector is employed to transduce packaging cell 
lines to fomi producer cell lines. Examples of packaging cells which may be 
transfected include, but are not limited to, the PE501, PAS 17, psi-2, psi-AM, 
PA12, T19'14X, VT'19'17'H2, psi-CRE, psi-CRIP, GP+E-Sd, GP'^envAml2, 
10 and DAN cell lines as described in Miller (Human Gene Therapy, Vol. 1, pg. 
5-14, (1990)). The vector may transduce the packaging cells through any means 
known in the art. Such means include, but are not limited to, electroporation, the 
use of liposomes, and CaP04 precipitation. In one alternative, the retroviral 
plasmid vector may be encapsulated into a liposome, or coupled to a lipid, and 
15 then administered to a host. 

The producer cell line generates infectious retroviral vector particles 
which include the nucleic acid sequence(s) encoding the polypeptides. Such 
retroviral vector particles then may be employed, to transduce eukaryotic cells, 
either in vitro or in vivo. The transduced eukaryotic cells will express the nucleic 
20 acid sequence(s) encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem cells, embryonic 
carcinoma cells, as well as hematopoietic stem cells, hepatocytes, fibroblasts, 
myoblasts, keratinocytes, endothelial cells, and bronchial epithelial cells. 

The genes introduced into cells may be placed under the control of 
25 inducible promoters, such as the radiation-inducible Egr-l promoter, (Maceri, 
H.J., et aL, Cancer /Je^., 56(19):431I (1996)), to stimulate StAR-B production or 
antisense inhibition in response to radiation, eg., radiation therapy for treating 
tumors. 
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Example II. StAR-B product 

The substantially purified StAR-B product of the invention has been 
defined above as the product coded fi^om the nucleic acid sequence of the 
5 invention. Preferably the amino acid sequence is an amino acid sequence having 
at least 70%, preferably at least 80% or 90% identity to the sequence identified as 
SEQ ID NO:2. The protein or polypeptide may be in mature and/or modified 
form, also as defined above. Also contemplated are protein fragments having at 
least 10 contiguous amino acid residues, preferably at least 10-20 residues, 

10 derived from the StAR-B product. 

The sequence variations are preferably those that are considered conserved 
substitutions, as defined above. Thus, for example, a protein with a sequence 
having at least 80% sequence identity with the protein identified as SEQ ID 
NO:2, preferably by utilizing conserved substitutions as defined above is also part 

15 of the invention. In a more specific embodiment, the protein has or contains the 
sequence identified SEQ ID NO:2. The StAR-B product may be (i) one in which 
one or more of the amino acid residues in a sequence listed above are substituted 
with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue), or (ii) one in which one or more of the amino acid residues 

20 includes a substituent group, or (iii) one in which the StAR-B product is fused 
with another compound, such as a compound to increase the half-life of the 
protein (for example, polyethylene glycol (PEG)), or a moiety which serves as 
targeting means to direct the protein to its target tissue or target cell population 
(such as an antibody), or (iv) one in which additional amino acids are fused to the 

25 StAR-B product. Such fi-agments, variants and derivatives are deemed to be 
within the scope of those skilled in the art from the teachings herein. 

A. Preparation of StAR-B product 
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Recombinant methods for producing and isolating the StAR-B product, 

and fragments of the protein are described above. 

In addition to recombinant production, fragments and portions of StAR-B 

product may be produced by direct peptide synthesis using solid-phase techniques 
5 (cf. Stewart et aL, (1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San 

Francisco; Merrifield J., J. Am. Chem. Soc, 85:2149-2154, (1963)). In vitro 

peptide synthesis may be performed using manual techniques or by automation. 

Automated synthesis may be achieved, for example, using Applied Biosystems 

431 A Peptide Synthesizer (Perkin Elmer, Foster City, Calif.) in accordance with 
10 tlie instructions provided by the manufacturer. Fragments of StAR-B product may 

be chemically synthesized separately and combined using chemical methods to 

produce the full length molecule. 

B. Therapeutic uses and compositions utilizing the StAR-B product 
15 The StAR-B product of the invention is generally useful in treating 

diseases and disorders which are characterized by a lower than normal level of 
StAR-B expression, and or diseases which can be cured or ameliorated by raising 
the level of the StAR-B product, even if the level is normal. 

StAR-B products or firagments may be administered by any of a number of 
20 routes and methods designed to provide a consistent and predictable 
concentration of compound at the target organ or tissue. The product-containing 
compositions may be administered alone or in combination with other agents, 
such as stabilizing compounds, and/or in combination with other pharmaceutical 
agents such as drugs or hormones. 
25 StAR-B product-containing compositions may be administered by a 

number of routes including, but not limited to oral, intravenous, intramuscular, 
transdermal, subcutaneous, topical, sublingual, or rectal means as well as by neisal 
application. StAR-B product-containing compositions may also be administered 
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via liposomes. Such administration routes and appropriate formulations are 

generally known to those of skill in the art. 

The product can be given via intravenous or intraperitoneal injection. 

Similarly, the product may be injected to other localized regions of the body. The 
5 product may also be administered via nasal insufflation. Enteral administration is 

also possible. For such administration, the product should be formulated into an 

appropriate capsule or elixir for oral administration, or into a suppository for 

rectal administration. 

The foregoing exemplary administration modes will likely require that the 
10 product be formulated into an appropriate carrier, including ointments, gels, 

suppositories. Appropriate formulations are well known to persons skilled in the 

art. 

Dosage of the product will vary, depending upon the potency and 
therapeutic index of the particular polypeptide selected. 

15 A therapeutic composition for use in the treatment method can include the 

product in a sterile injectable solution, the polypeptide in an oral delivery vehicle, 
the product in an aerosol suitable for nasal administration, or the product in a 
nebulized forni, all prepared according to well known methods. Such 
compositions comprise a therapeutically effective amount of the compound, and a 

20 pharmaceutically acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The product of the invention may also be used to modulate 
endothelial differentiation and proliferation as well as to modulate apoptosis 
either ex vivo or in vitro^ for example, in cell cultures, 

25 

Example III. Screening methods for activators and deactivators (inhibitors) 

The present invention also includes an assay for identifying molecules, 
such as synthetic drugs, antibodies, peptides, or other molecules, which have a 
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modulating effect on the activity of the StAR-B product, e.g. activators or 
deactivators of the StAR-B product of the present invention. Such an assay 
comprises the steps of providing ^ StAR-B product encoded by the nucleic acid 
sequences of the present invention, contacting the StAR-B protein with one or 
5 more candidate molecules to determine the candidate molecules modulating 
effect on the activity of the StAR-B product, and selecting from the molecules a 
candidate's molecule capable of modulating StAR-B product physiological 
activity. 

StAR-B product, its catalytic or immunogenic fragments or oligopeptides 
10 thereof, can be used for screening therapeutic compounds in any of a variety of 
drug screening techniques. The fragment employed in such a test may be free in 
solution, affixed to a solid support, borne on a cell membrane or located 
intracellularly. The formation of binding complexes, between StAR-B product 
and the agent being tested, may be measured. Alternatively, the activator or 
15 deactivator may work by serving as agonist or antagonist, respectively, of the 
StAR-B receptor and their effect may be determined in connection with the 
receptor. 

Another technique for drug screening which may be used provides for 
high throughput screening of compoxmds having suitable binding affinity to the 

20 StAR-B product is described in detail by Geysen in PCT Application WO 
84/03564, published on Sep. 13, 1984. In summary, large numbers of different 
small peptide test compoimds are synthesized on a solid substrate, such as plastic 
pins or some other surface. The peptide test compounds are reacted with the full 
StAR-B product or with fragments of StAR-B product and washed. Bound 

25 StAR-B product is then detected by methods well known in the art. Substantially 
purified StAR-B product can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 
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antibodies can be used to capture the peptide and immobilize it on a solid 
support. 

Antibodies to the StAR-B product, as described in Example IV below, 
may also be used in screening assays according to methods well known in the art. 

5 For example, a "sandwich" assay may be performed, in which an anti-StAR-B 
antibody is affixed to a solid surface such as a microliter plate and StAR-B 
iproduct is added. Such an assay can be used to capture compounds which bind to 
the StAR-B product. Alternatively, such an assay may be used to measure the 
ability of compoimds to influence with the binding of StAR-B product to the 

10 StAR-B receptor [ I and then select those compounds which effect the binding. 

Example IV. Anti-StAR-B antibodies 
A, Synthesis 

In still another 2ispect of the invention, the purified StAR-B product is 
15 used to produce anti-StAR-B antibodies which have diagnostic and therapeutic 
uses related to the activity, distribution, and expression of the StAR-B product, in 
particular therapeutic applications in inhibiting the effect of the StAR-B 
cholesterol in the transport. 

Antibodies to StAR-B product may be generated by methods well known 
20 in the art. Such antibodies may include, but are not limited to, polyclonal, 
monoclonal, chimeric, humanized, single chain, Fab fragments and fragments 
produced by an Fab expression library. Antibodies, i.e., those which inhibit 
dimer formation, are especially preferred for therapeutic use. 

A fragment StAR-B product for antibody induction does not require 
25 biological activity but have to feature immunological activity; however, the 
protein fragment or oligopeptide must be antigenic. Peptides used to induce 
specific antibodies may have an amino acid sequence consisting of at least five 
amino acids, preferably at least 10 amino acids of the sequences specified in SEQ 
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ID NO: 2. Preferably they should mimic a portion of the amino acid sequence of 
the natural protein and may contain the entire amino acid sequence of a small, 
naturally occurring molecule. Short stretches of StAR-B protein amino acids may 
be fused with those of another protein such as keyhole limpet hemocyanin and 

5 Emtibody produced against the chimeric molecule. Procedures well known in the 
art can be used for the production of antibodies to StAR-B product. 

For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, etc may be immunized by injection with StAR-B product or any 
portion, fragment or oligopeptide which retains immunogenic properties. 

10 Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to Freund's, 
mineral gels such as aluminum hydroxide, and surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. BCG (bacilli Calmette-Guerin) and 

15 Corynebacterium parvum are potentially useful human adjuvants. 

Monoclonal antibodies to StAR-B protein may be prepared using any 
technique which provides for the production of antibody molecules by continuous 
cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein {Nature 256:495-497, (1975)), the 

20 human B-cell hybridoma technique (Kosbor et aL, JmmunoL Today 4:72, (1983); 
Cote era/.. Proa Natl. Acad Set 80:2026-2030, (1983)) and the EBV-hybridoma 
technique (Cole, et aL, Mol Cell BioL 62:109-120, (1984)). 

Techniques developed for the production of ''chimeric antibodies", the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule 

25 with appropriate antigen specificity and biological activity can also be used 
(Morrison et aL, Proc. Natl Acad. Set 81:6851-6855, (1984); Neuberger et al. 
Nature 312:604-608, (1984); Takeda al, Nature 314:452-454, (1985)), 
Alternatively, techniques described for the production of single chain antibodies 
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(U.S. Pal. No, 4,946,778) can be adapted to produce single-chain antibodies 

specific lor the StAR-B protein. 

Antibodies may also be produced by inducing in vivo production in the 

lymphocyte population or by screening recombinant immunoglobulin libraries or 
5 panels of highly specific binding reagents as disclosed in Orlandi et al. (Proc. 

Natl. Acad Sci. 86:3833-3837, 1989)), and Winter G and Milstein C, (Nature 

349:293-29^>. (1^91)). 

Aniibod) fragments which contain specific binding sites for StAR-B 

protein may also be generated. For example, such fragments include, but are not 
10 limited to, the F(ab')2 fragments which can be produced by pepsin digestion of 

the antibody molecule and the Fab fragments which can be generated by reducing 

the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 

libraries may be constructed to allow rapid and easy identification of monoclonal 

Fab fragments with the desired specificity (Huse W.D. et aL, Science 
15 256:1275-1281,(1989)). 

B. Diagnostic applications of antibodies 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 

20 specificities are well known in the art. Such immunoassays typically involve the 
formation of complexes between StAR-B product and its specific antibody and 
the measurement of complex formation. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two noninterfering 
epitopes on a specific StAR-B product is preferred, but a competitive binding 

25 assay may also be employed. These assays are described in Maddox D.E., et al, 
(J. Exp, Med. 158:1211, (1983)). 

Antibodies which specifically bind StAR-B product are usefiil for the 
diagnosis of conditions or diseases characterized by over or under expression of 
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StAR-B. Alternatively, such antibodies may be used in assays to monitor patients 
being treated with StAR-B product, its activators, or its deactivators. Diagnostic 
assays for StAR-B protein include methods utilizing the antibody and a label to 
detect StAR-B product in human body fluids or extracts of cells or tissues. The 

5 products and antibodies of the present invention may be used with or without 
modification. Frequently, the proteins and antibodies will be labeled by joining 
them, either covalently or noncovalently, with a reporter molecule. A wide 
variety of reporter molecules are known in the art. 

A variety of protocols for measuring StAR-B product, using either 

10 polyclonal or monoclonal antibodies specific for the respective protein are knovm 
in the art. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), and fluorescent activated cell sorting (FACS). As 
noted above, a two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on StAR-B product is 

15 preferred, but a competitive binding assay may be employed. These assays are 
described, among other places, in Maddox, et al (supra). Such protocols provide 
a basis for diagnosing altered or abnormal levels of StAR-B product expression. 
Normal or standard values for StAR-B product expression are established by 
combining body or cell extracts taken from normal subjects, preferably human, 

20 with antibody to StAR-B product under conditions suitable for complex 
formation which are well known in the art. The amount of standard complex 
formation may be quantified by various methods, preferably by photometric 
methods. Then, standard values obtained from normal samples may be compared 
with values obtained from samples from subjects potentially affected by disease. 

25 Deviation between standard and subject values establishes the presence of disease 
state. 

The antibody assays are useful to determine the level of StAR-B present in 
a body fluid sample, in order to determine whether it is being overexpressed or 
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underexpressed in the tissue, or as an indication of how StAR-B levels are 
responding to drug treatment. 

C. Therapeutic uses of antibodies 

5 In addition to their diagnostic use the antibodies may have a therapeutical 

utility in blocking or decreasing the activity of the StAR-B product in 
pathological conditions where beneficial effect can be achieved by such a 
decrease. 

The antibody employed is preferably a humanized monoclonal antibody, 
10 or a human Mab produced by known globulin-gene library methods. The 
antibody is administered typically as a sterile solution by IV injection, although 
other parenteral routes may be suitable. Typically, the antibody is administered 
in an amount between about 1-15 mg/kg body weight of the subject. Treatment is 
continued, e.g., with dosing every 1-7 days, until a therapeutic improvement is 
15 seen. 

Although the invention has been described with reference to specific 
methods and embodiments, it is appreciated that various modifications and 
changes may be made without departing fi-om the invention. 
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SEQUENCE LISTING 

<110> COMPUGEN LTD. 

5 <120> NOVEL AMINO ACID AND NUCLEIC ACID SEQUENCES 

<130> COMPUl 

<140> 
10 <141> 

<160> 2 

<170> Patentin Ver , 2.0 

<210> 1 
<211> 1615 
<212> DNA 
<213> human 

20 

<400> 1 

ccgcagctaa gcgcactcgc cgacgcaatg gacccggcgc tggcagccca gatgagcgag 60 
gctgtggcca gaagatgctc cagtaccggc gggacacagc aggctggaag atttgccggg 120 
aaggcaangg agtttcagtt tcctggaggc catctgtgga gtttccaggg aacctgtacc 180 

25 gaggagaagg cattgtatat gggacactag aggaggtgtg ggactgtgtg aagccagctg 240 
ttggaggcct acgagtgaag tgggatgaga atgtgaccgg ttttgaaatt atccaaagca 300 
tcactgacac cctgtgtgta agcagaacct ccactccctc cgctgccatg aagctcattt 360 
ctcccagaga ttttgtggac ttggtgctag tcaagagata tgaggatggg accatcagtt 420 
ccaacgccac ccatgtggag catccgttat gtcccccgaa gccaggtttt gtgagaggat 4 80 

30 ttaaccatcc ttgtggttgc ttctgtgaac ctcttccagg ggaacccacc aagaccaacc 540 
tggtcacatt cttccatacc gacctcagcg gttacctccc acagaacgtg gtggactcct 600 
tcttcccccg cagcatgacc cggttttatg ccaaccttca gaaagcagtg aagcaattcc 660 
atgagtaatg ctatcgttac ttcttggcaa agaactcccg tgactcatcg aggagctcca 720 
gctgttggga caccaaggag cctgggagca cgcagaggcc tgtgttcact ctttggaaca 780 

35 agctgatgga ctgcgcatct ctgagaatgc caaccagagg cggcagccca cccttcctgc 840 
ctcctgcccc actcagggtt ggcgtgtgat gagccattca tgtgttccaa actccatctg 900 
cctgttaccc aaacacgcct ctcctggcag ggtagaccca ggcctctaac catctgacag 960 
agactcggcc tggacaccat gcgatgcact ctggcaccaa ggctttatgt gcccatcact 1020 
ctcagagacc acgtttccct gactgtcata gagaatcatc atcgccactg aaaaccaggc 1080 

40 cctgttgcct tttaagcatg taccgctccc tcagtcctgt gctgcagccc cccaaatata 1140 
tttttctgat atagaccttg tatatggctt taatgccgca aaatatttat ttttccttaa 1200 
aaaaggtgtc aacttggaaa taatggttta aaaacaggat aagcattaag gaaaaacact 1260 
ttcaatgtgt cttccatttg atgaatttgt ttttctctct ttatccccgc aagtggagtt 1320 
tcatgtcctc ggtgaaacca gacagtgtga atctgttcca gcccaaatct gcagcattag 1380 

45 ggatgagttc tcrgaagtga ttctgaactg agcacgcact catgtctgca tggggaactc 1440 
tggggagaag agccttcctt ttctttccct tgggccattt gcctttcctt gtcgtcttac 1500 
tgagggcgga ggcagggagg gtctctgtct ttccagggcc ctgggcaggg ccatcctggc 1560 
cattcaggga aagatgggaa gagttagggc tccgttttag gcagcctggg tggga 1615 

50 <210> 2 
<211> 221 
<212> PRT 
<213> human 

55 <400> 2 

Ala Ala Lys Arg Thr Arg Arg Arg Asn Gly Pro Gly Ala Gly Ser Pro 
15 10 15 

Asp Glu Arg Gly Cys Gly Gin Lys Met Leu Gin Tyr Arg Arg Asp Thr 
60 20 25 30 

Ala Gly Trp Lys lie Cys Arg Glu Gly Xaa Gly Val Ser Val Ser Trp 
35 40 45 
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Arg Pro Ser Val Glu Phe Pro Gly Asn Leu Tyr Arg Gly Glu Gly lie 
50 55 60 

5 Val Tyr Gly Thr Leu Glu Glu Val Trp Asp Cys Val Lys Pro Ala Val 
65 70 75 80 

Gly Gly Leu Arg Val Lys Trp Asp Glu Asn Val Thr Gly Phe Glu lie 
85 90 95 

10 

lie Gin Ser lie Thr Asp Thr Leu Cys Val Ser Arg Thr Ser Thr Pro 
100 105 110 

Ser Ala Ala Met Lys Leu lie Ser Pro Arg Asp Phe Val Asp Leu Val 
15 115 120 125 

Leu Val Lys Arg Tyr Glu Asp Gly Thr lie Ser Ser Asn Ala Thr His 
130 135 140 

20 Val Glu His Pro Leu Cys Pro Pro Lys Pro Gly Phe Val Arg Gly Phe 
145 150 155 160 

Asn His Pro Cys Gly Cys Phe Cys Glu Pro Leu Pro Gly Glu Pro Thr 
165 170 175 

25 

Lys Thr Asn Leu Val Thr Phe Phe His Thr Asp Leu Ser Gly Tyr Leu 
180 185 190 

Pro Gin Asn Val Val Asp Ser Phe Phe Pro Arg Ser Met Thr Arg Phe 
30 195 200 205 

Tyr Ala Asn Leu Gin Lys Ala Val Lys Gin Phe His Glu 
210 215 220 
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CLAIMS 

1. An isolated nucleic acid sequence selected from the group consisting of: 

(i) the nucleic acid sequence depicted in SEQ ID NO: 1 ; 

(ii) nucleic acid sequences having at least 70% identity with the 
5 sequence of (i); and 

(iii) fragments of (i) or (ii) of at least 20 b.p. 

2. A nucleic acid sequence according to Claim l(ii) wherein the nucleic acid 
sequences have at least 80% identity with the sequence of Claim l(i). 

3. A nucleic acid sequence according to Clsiim 2, wherein the nucleic acid 
10 sequences have at least 90% identity. 

4. An isolated nucleic acid sequence complementary to the nucleic acid sequence 
of Claim 1. 

5. An amino acid sequence selected from the group consisting of: 

(i) an amino acid sequence coded by the isolated nucleic acid sequence 
15 of Claim 1; 

(ii) fragments of the amino acid sequence of (i) having at least 10 amino 

acids; 

(iii) analogues of the amino acid sequences of (i) or (ii) in which one or 
more amino acids has been added, deleted, replaced or chemically modified 

20 without substantially altering the biological activity of the parent amino acid 
sequence, 

6. An amino acid sequence according to Claim 5, as depicted in SEQ ID NO:2. 

7. An isolated nucleic acid sequence coding for the amino acid sequence of Claim 
5. 

25 8. A purified antibody which binds specifically to the amino acid sequence of 
Claim 5. 

9- An expression vector comprising the nucleic acid sequences of Claim 1 and 
control elements for the e?qDression of the nucleic acid sequence in a suitable host. 
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10. An expression vector comprising the nucleic acid sequence of Claim 4, and 
control elements for the expression of the nucleic acid sequence in a suitable host. 

11. A host cell transfected by the expression vector of Claim 9. 

12. A pharmaceutical composition comprising a pharmaceutically acceptable 
5 carrier and as an active ingredient an agent selected j&om the group consisting of: 

(i) the expression vector of Claim 9; and 

(ii) the amino acid sequence of Claim 5. 

13. A pharmaceutical composition according to Claim 12, for treatment of 
diseases which can be ameliorated or cured by raising the level of the steroidogenic 

1 0 acute regulatory protein homolog B (StAR-B). 

14. A pharmaceutical composition comprising a pharmaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

(i) the nucleic acid sequence of Claim 4; 

(ii) the expression vector of Claim 10; and 
15 (iii) the purified antibody of Claim 8. 

15. A pharmaceutical composition according to Claim 14, for treatment of 
diseases which can be ameliorated or cured by decreasing the level of the StAR-B 
product 

16. A pharmaceutical composition according to Claim 12, for regulating the 
20 levels of steroids 

17. A pharmaceutical composition according to Claim 16 for regulating the 
levels of androgen. 

18. A method for detecting an StAR-B nucleic acid sequence in a biological 
sample, comprising the steps of: 

25 (a) hybridizing to nucleic acid material of said biological sample a 

nucleic acid sequence of Claim 1; and 

(b) detecting said hybridization complex; 

wherein the presence of said hybridization complex correlates with the 
presence of an StAR-B nucleic acid sequence in the said biological sample. 
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19. A method according to Claim 18, wherein the nucleic acid material of said 
biological sample are mRNA transcripts. 

20. A method according to Claim 18, where the nucleic acid sequence is present 
in a nucleic acid chip. 

5 21. A method for identifying candidate compounds capable of binding to the 
StAR-B product and modulating its activity the method comprising: 

(i) providing a protein or polypeptide comprising an amino acid 
sequence substantially as depicted in SEQ ID NO: 2, or a fragment of such a 
sequence; 

10 (ii) contacting a candidate compound with said amino acid sequence; 

(iii) determining the effect of said candidate compound on the biological 
activity of said protein or polypeptide and selecting those compounds which show a 
significant effect on said biological activity. 

22. A method according to Claim 21, wherein the compound is an activator and 
15 the measured effect is increase in the biological activity. 

23. A method according to Claim 21, wherein the compound is an deactivator 
and the effect is decrease in the biological activity. 

24. An activator of the amino acid sequence of Claim 5. 

25. An deactivator of the amino acid sequence of Claims 5. 

20 26. A method for detecting SrAR-B-product in a biological sample, comprising 
the steps of: 

(a) contacting with said biological sample the antibody of Claim 8, 
thereby fomiing an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

25 wherein the preseaice of said antibody-antigen complex correlates with the 

presence of StAR-B product in said biological sample. 
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ccgcagctaa 


gcgcactcgc 


cgacgcaatg 


gacccggcgc 


tggcagccca 


gatgagcgag 


60 


gctgtggcca 


gaagatgctc 


cagtaccggc 


gggacacagc aggcbggaag 


atttgccggg 


120 


aaggcaangg 


agtttcagtt 


tcctggaggc 


catctgtgga 


grtttccaggg 


aacctgtacc 


180 


gaggagaagg caftgtatat gggacactag aggaggtgtg ggacrtgtgtg aagccagclig 


240 




acgagtgaag 


tgggatgaga 


atgtgaccgg 


u u u ugaaau u 


a'tccaaagca 


300 


ucacugacac 


cctgtgtgta 


agcagaacci: 


ccacticcctc 


cgc'&gcca'tg 


aagclicat:^t 


360 


wucccagaga 


ttttgtggac 


ttgg-tgctag ticaagagata 


1:gaggat:ggg 


accaticag^^ 


420 


ccaacgccac 


ccatgtggag 


catccgttat 


gtcccccgaa 


gccaggtttt 


gtgagaggat 


480 


't'baacca^cc 


ttgtggttgc ttctgtgaac 


ctcttccagg 


ggaacccacc 


aagaccaacc 


540 


»■ w 


cttccatacc 


gacctcagcg 


gttacctccc 


acagaacglzg 


guggacuCCu 


600 




cagcatgacc 


cggtttrtatg 


ccaacct:t:ca 


gaaagcagtg 




660 


a cT a crti a. a. rr 


ctatcgli^ac 


ttcttggcaa 


agaactcccg 


tigactca^cg 


a rmsx 


720 


CT e t crt t scrcf a 


caccaaggag 


cctgggagca 


cgcagaggcc 


tgtgttcact 




780 


aoc^ffatiocTa 


ctgcgcatct 


ctgagaatgc 


caaccag^gg 


cggcagccca 


eectteetcTe 


840 


ctcctgcccc 


acticagggirt 


ggcy LgLgat 


gagccattica 


tgt:grttccaa 


a c ca't ctig 


900 


cctgttaccc 


aaacacgcct: 


ctcctggcag 


ggtiagaccca 


ggccrtctaac 


cat: ct gacag 


960 


agac1:cggcc 


tggacaccat: 


gcgatgcact: 


cbggcaccaa ggc'tt'tatgt 


gccca'tcaci: 


1020 


ctcagagacc 


acgtttccct 


gacbg^cata 


gagaatica'tc 


atcgccactg 


aaaaccaggc 


1080 


.cctgttgcct 


t:tt:aagcatg 


taccgctccc 


tcagtcckgt 


gc^gcagccc 


cccaaa^a^a 


1140 


tttttctgat 


atagaccrttg 


tata'bggc'tt 


taa^gccgca 


aaa^attrkat 


-ktifbccttaa 




aaaaggtgtc 


.aac^'tggaaa 


taat:ggl:tta 


aaaacaggat 


aagcattiaag 


gaaaaacact 




ttcaatgtgt 


cttccatttg 


atgaa't^tigt: 


ttttctctct 


ttatccccgc 


aag1;ggag1:t 


1320 


tcatgtcctc 


gg^gaaacca 


gacag Lgtiga 


atctgttcca 


gcccaaatct 


gcagca^'tag 


1380 


ggatgagttc tcrgaagtga 


ttctgaactg agcacgcact 


catgtctgca 


tggggaactc 


1440 


tggggagaag 


agccttcctt 


ttctt-tccct: 


tgggccat^l: 


gcctttcctt 


gtcgtcttac 


1500 


tgagggcgga 


ggcagggagg 


gtctcrtgtct 


ttccagggcc 


ctgggcaggg 


ccatcctggc 


1560 


cattcaggga 


aagatgggaa 


gagttagggc tccgttttag gcagcctggg 


tggga 


1615 



rig.1 
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Ala Ala Lys Arg Thr Arg Arg Arg Asn 
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Gly Pro Gly Ala Gly Ser Pro 
10 15 



Asp 



Glu Arg Gly Cys Gly Gin Lys Met Leu Gin Tyr Arg Arg Asp Thr 
20 25 '° 



Ala Gly Trp Lys lie Cys Arg Glu Gly Xaa Gly Val Ser Val Ser Trp 
35 40 45 

Arg Pro Ser Val Glu Phe. Pro Gly Asn Leu Tyr Arg Gly Glu Gly lie 
50 55 60 

Val Tyr Gly Thr Leu Glu Glu Val Trp Asp Cys Val Lys Pro Ala Val 
65 -70 75 80 

Gly Gly Leu Arg Val Lys Trp Asp Glu Asn Val Thr Gly Phe Glu lie 
85 90 95 

He Gin Ser He Thr Asp Thr Leu Cys Val Ser Arg Thr Ser Thr Pro 
100 ICS 110 

Ser Ala Ala Met Lys Leu He Ser Pro Arg Asp Phe Val Asp Leu Val 
115 120 125 

Leu Val Lys Arg Tyr Glu Asp Gly Thr He Ser Ser Asn Ala Thr His 
130 135 140 



Fig. 2 
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Val Glu His Pro Leu Cys Pro Pro Lys Pro Gly Phe Val Arg Gly Phe 
145 150 155 160 



Asn His Pro Cys Gly Cys Phe Cys Glu Pro Leu Pro Gly Glu Pro Thr 
165 170 . 

Lys Thr Asn Leu Val Thr Phe Phe His Thr Asp Leu Ser Gly Tyr Leu 
180 185 190- 

Pro Gin Asn Val Val Asp Ser Phe Phe Pro Arg Ser Met Thr Arg Phe 
195 200 205 

Tyr Ala Asa Leu Gin Lys Ala Val Lys Gin Phe His Glu 
210 215 220 



Fig. 2 (continued) 
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(57 9669) steroidogenic acute regulatoy protein, StAR [human, follicle cells. 
Peptide Mitochondrial, 285 aa] [Homo sapiens] gi 1 2661026 (AF035277) steroidogenic 

acute regulatory protein [Homo sapiens] 

alignment's cores : 
Quality: 169. 00-' Length: 203 

Ratio: 1.420 Gaps: 6 

Percent Similarity: 58.621 Percent Identity: 26.108 

alignment_bloc)c : 
cchll0_t53767_0.pep x gi 1 1172250 Ibbs 1 17138 4 

/□.ign seg 1/1 to: gi 1 1172250 1 bbs 1 171384 from: 1 to: 285 

23 QKMLQYRRDTAGWKICRE GXGVSVSWRPSVEFPGNLYRGEGIVYG 67 

I I I : Ml : I i I : I : I I : : : I I : I 

84 QKALGILSNQEGWK- .KESQQDNGDK\^K\nrPDV, . .GKVFRLEVVV^ 128 
» • . • • 

68 TLEEVWIXJV. . .KPAVGGLRVKWDENVTGFEIIQSITDTLCVSRTSTPSA 114 
: I : : : : 1:1 : I : I I : : : : I : I : : : : : I 

129 PMERLYEELVERMEAMG EWNPNVKEIKVLQKIGKDTFITHEIiAAEA 174 

• . • 

115 AMKLISPRDFVDLVLVKRYEDGTISSNATHVEHPLCPPKPGFVRGFNHPC 164 
I : I : : M 1 I I : : : I I : : I : : : | : | : I : : I 

175 AGOT^VGPRDFVSVRCAKR.RGSTCVIAGMATDFGNMPEQKGVIRAEHGPT 223 

* . • • • 

165 GCFCEPLPGEPTKTiaVTFFHTDLSGyLPQNVVDSFrPRSMTRFYANLQK 214 

:|| |:|:|t:|: :: ||:|:M:::::: I 
224 CM\^P1JU3SPSKTKLTWLLSIDLKGWLPKSIINQVLSQTQVDFANHLRK 273 

215 AVK 217 

274 RLE 276 



Fig. 3 
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